Rank | Count | Beginning |
---|---|---|
83886 | 7551 | У |
10218 | 3644 | В |
48452 | 2573 | На |
29224 | 2251 | З |
30286 | 1418 | За |
63037 | 1371 | Після |
15432 | 1046 | Він |
25859 | 974 | До |
94160 | 909 | Це |
25134 | 676 | Для |
5590 | 674 | Але |
38786 | 652 | Історія |
68141 | 650 | При |
61962 | 584 | Під |
40082 | 517 | Його |
80026 | 506 | Також |
70135 | 444 | Проте |
56492 | 425 | Однак |
18249 | 410 | Вони |
43803 | 400 | Крім |
99067 | 392 | Як |
8479 | 383 | Біографія |
52656 | 381 | Населення |
17877 | 365 | Вона |
97015 | 360 | Через |
4579 | 358 | А |
96283 | 325 | Ця |
99531 | 316 | Якщо |
94606 | 313 | Цей |
37194 | 309 | І |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV